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Abstract 

In a recent paper the author obtained optimal bounds for the strong Gaus- 
sian approximation of sums of independent Revalued random vectors with 
finite exponential moments. The results may be considered as generalizations 
of well-known results of Komlos-Major-Tusnady and Sakhanenko. The de- 
pendence of constants on the dimension d and on distributions of summands 
is given explicitly. Some related problems are discussed. 
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1. Introduction 

Let Xi, . . . , X n , ... be mean zero independent Revalued random vectors and 
D„ = covSVi the covariance operator of the sum S n = y\ —1 Xi. By the Central 
Limit Theorem, under some simple moment conditions the distribution of nor- 

— 1/2 . r-t 

malized SUIUS iLvn S n is close to the standard Gaussian distribution. The in- 
variance principle states that, in a sense, the distribution of the whole sequence 

1/2 1/2 1/2 

D„ Si, ... ,B n S n , ... is close to the distribution of the sequence D n T\, . . . , 
1 ^ 2 T„, . . ., where T n = J27=i ^ an( ^ . . . ,Y ni . . . is a corresponding sequence 
of independent Gaussian random vectors (this means that Yi has the same mean 
and the same covariance operator as Xi, i — 1, . . . , n, . . .). 

We consider here the problem of strong approximation which is more delicate 
than that of estimating the closeness of distributions. It is required to construct on 
a probability space a sequence of independent random vectors X\ , . . . , X n (with 
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given distributions) and a corresponding sequence of independent Gaussian random 
vectors Y\ , . . . , Y n so that the quantity 



would be so small as possible with large probability. Here || • || is the Euclidean 
norm. It is clear that the vectors even with the same distributions can be very far 
one from another. 

In some sense this problem is one of the most important in probability ap- 
proximations because many well-known probability theorems can be considered as 
consequences of results about strong approximation of sequences of sums by corre- 
sponding Gaussian sequences. This is related to the law of iterated logarithm, to 
several theorems about large deviations, to the estimates for the rate of convergence 
of the Prokhorov distance in the invariance principles (Prokhorov [19], Skorokhod 
[26], Borovkov [4]), as well as to the Strassen-type approximations (Strassen [28], 
see, for example, Csorgo and Hall [8]). 

The rate for strong approximation in the one-dimensional invariance prin- 
ciple was studied by many authors (see, e.g., Prokhorov [19], Skorokhod [26], 
Borovkov [4] , Csorgo and Revesz [6] and the bibliography in Csorgo and Revesz [7] , 
Csorgo and Hall [8], Shao [20]). Skorokhod [26] developed a method of construction 
of close sequences of sequential sums of independent random variables on the same 
probability space. For a long time the best rates of approximation were obtained 
by this method, known now as the Skorokhod embedding. However, Komlos, Ma- 
jor and Tusnady (KMT) [17] elaborated a new, more powerful method of dyadic 
approximation. With the help of this method they obtained optimal rates of Gaus- 
sian approximation for sequences of independent identically distributed random 
variables. 

We restrict ourselves on the most important case, where the summands have 
finite exponential moments. Sakhanenko [24] generalized and essentially sharpened 
KMT results in the case of non-idcntically distributed random variables. He con- 
sidered the following class of onc-dimcnsional distributions: 



(the distribution of a random vector £ will be denoted by £(£))• His main result is 
formulated as follows. 

Theorem 1 (Sakhanenko [24]). Suppose that t > 0, and £i,...,£ n are in- 
dependent random variables with £(^j) 6 S\(t), j = 1, . . . ,n. Then one can con- 
struct on a probability space a sequence of independent random variables X\, . . . , X n 
and a sequence of independent Gaussian random variables Y\, . . . , Y n so that £(Xj) 




= max 

l<k<n 




S!(t) = {£(£): E£ = 0, E |£| 3 cxp (t -1 |£|) <tE|£| 2 } 



= Cfe), FiYj = 0, EF/ = EX? 



, j = 1, . . . , n, and 



E cxp (cA(X, Y) It) < 1 + B/t, 



(1.1) 



where c is an absolute constant and B 2 = + • • • + 
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KMT [17] supposed that £, £1, . . . , are identically distributed and E e^ h '^ < 
oo, for h G V, where V C R d is some neighborhood of zero. The KMT (1975-76) 
result follows from Theorem 1. It is easy to see that there exists t(F) such that 
F = C(£_j) G S\(t(F)). Applying the Chebyshcv inequality, we observe that (1.1) 
imply that 

P (ci A(X, Y) /t(F) >x)< exp (log (l + y/n E £ 2 /r(F)) - x) , a: > 0. (1.2) 

Inequality (1.2) provides more information than the original KMT formulation 
which contains unspecified constants depending on F. In (1.2) the dependence 
of constants on the distribution F is written out in an explicit form. The quantity 
t(F) can be easily calculated or estimated for any concrete distribution F. 

The first attempts to extend the KMT and Sakhancnko approximations to 
the multidimensional case (see Berkes and Philipp [3], Philipp [18], Bcrgcr [2], 
Einmahl [10, 11]) had a partial success only. Comparatively recently U. Einmahl 
[12] obtained multidimensional analogs of KMT results which are close to optimal. 
Zaitsev [33, 34] removed an unnecessary logarithmic factor from the result of Ein- 
mahl [12] and obtained multidimensional analogs of KMT results (see Theorem 2 be- 
low). In Theorem 2 the random vectors are, generally speaking, non- identically dis- 
tributed. However, they have the same identity covariance operator I. Therefore, 
the problem of obtaining an adequate multidimensional generalization of the main 
result of Sakhancnko [24] remained open. This generalization is given in Theorem 3 
below. 

2. Main results 

For formulations of results we need some notations. Let Aci(t), t > 0, d G N, 
denote classes of (i-dimensional distributions, introduced in Zaitsev [29], see as well 
Zaitsev [33-35]. The class Ad(r) (with a fixed t > 0) consists of c?-dimensional dis- 
tributions F for which the function ip(z) = ip(F, z) = log J Rd e^'^ F{dx} (ip(0) = 0) 
is defined and analytic for ||z|| r < 1, z G C d , and \d u d% <p(z)\ < \\u\\t (Db,«) for all 
u,v G R d and ||z|| r < 1, where D = covF, the covariance operator corresponding 
to F, and d u ip is the derivative of the function ip in direction u. 

Theorem 2 (Zaitsev, [33, 34]). Suppose that r > 1, a > and £i,...,£ n 
are random vectors with distributions £(£fc) G Ad(r), E£fc = 0, cov^ = I, k — 
1, . . . , n. Then one can construct on a probability space a sequence of independent 
random vectors X\ , . . . , X n and a sequence of independent Gaussian random vectors 
Yi, . . . , Y n so that 

£(X k ) EY k = 0, cov£(F fe )=I, k = l,...,n, 

and 

e exp { c i%>L x /d ] ) * exp ( c2(a) dm+a iog * ^) ' 

where c\(a), c 2 (a) are positive quantities depending on a only and log* b = max{l, 
log&} 7 for b > 0. 



110 



A. Yu. Zaitsev 



Corollary 1. In the conditions of Theorem 2 for all x > the following 
inequality is valid 

P ( A(X, Y) > C ^> y " ^ ' B/r2 ' + ,1 < exp f 



ci(a) I " V Td 7 /2i g*d 



It is easy to see that if V C R d is some neighborhood of zero and EeM < oo, 
for h E V, then F = £(£) e _4 d (c(F)). Below we list some simple and useful 
properties of classes Acl{t) which are essential in the proof of Theorem 2. Theorem 2 
implies in one-dimensional case Sakhancnko's Theorem 1 for identically distributed 
random variables with finite exponential moments as well as the result of KMT [17]. 

Corollary 2. Suppose that a random vector £ has finite exponential mo- 
ments EeC 1 '^, for h E V, where V C R d is some neighborhood of zero. Then 
one can construct on a probability space a sequence of independent random vectors 
X\, X2, ■ ■ ■ and a sequence of independent Gaussian random vectors Y\, Y2, . . . so 
that 

C(X k )=C(£), EY k =0, covFfc = covC k =1,2,..., 

and 

n n 

Y,X k -Y J Yk = 0{\ogn) a.s.. 

k=l fe=l 



As it is noted in KMT [17], from the results of Bartfai [1] that the rate of 
approximation in Corollary 2 is the best possible for non-Gaussian vectors £. An 
analog of Corollary 2 was obtained by Einmahl [12] under additional smoothness- 
type restrictions on the distribution £(£). The following statement is a sharpening 
of Corollary 2. 

Corollary 3 (Zaitsev [36]). Suppose that a random vector £ has the distri- 
bution such that £(B -1 ' 2 £) G Acl{t), where ID = cov£(£) is a reversible operator. 
Let cr 2 , a > 0, be the maximal eigenvalue of D. Then for any a > there exists a 
construction from Corollary 2 such that 



P < lim sup 



1 



logn 



k=i 



k=l 



< c 3 (a) a rd 23/4+a log* d \ = 1 



(2.1) 



with 03(a) depending on a only. 

In Theorems 2 and Corollary 3 we consider the case r > 1. The case of small 
r was investigated by Gotze and Zaitsev [16]. It is shown that under additional 
smoothness- type restrictions on the distribution £(£) the expression in the right- 
hand side of the inequality in (2.1) can be arbitrarily small if the parameter r is 
small enough. It is clear that the statements of Theorem 2 and Corollary 3 becomes 
stronger for small r. In Gotze and Zaitsev [16] one can find simple examples in 
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which the sufficiently complicated smoothness condition is satisfied. The approxi- 
mation is better in the case when summands have smooth distributions which are 
close to Gaussian ones (see inequalities (3.1) and (3.2) below). 

The following Theorem 3 is a generalization of Theorem 2 to the case of mul- 
tivariate random variables. In one-dimensional situation, Theorem 3 implies Theo- 
rem 1. 

Theorem 3 (Zaitsev [35]). Suppose that a > 0, r > 1, and £i,...,£ n are 
independent random vectors with E£j = 0, j = 1, . . . ,n. Assume that there exists 
a strictly increasing sequence of non-negative integers itlq = 0, m\, . . . ,m s = n 
satisfying the following conditions. Write 

Ck = £m fc _i+l H h£ mfc , k = l,...,s, 

and suppose that (for all k = 1, ...,s) £{(k) &Ad{r), cov (k = Bfc and, for all 
u G K d , 

c 4 \\u\\ 2 < (M k u,u) <c 5 \\u\\ 2 (2.2) 

with some constants C4 and C5. Then one can construct on a probability space a 
sequence of independent random vectors X\ , . . . , X n and a corresponding sequence 
of independent Gaussian random vectors Y\,...,Y n so that C(Xj) = C(£j), EYj = 
0, cov C{Yj) = cov£(Xj), j = 1, . . . ,n, and 

E cxp { Td^foPd ) * exp (° 2 d3+a log * (s/r2) ) ' 

where a\, a 2 are positive quantities depending only on a, C4, C5. 



3. Properties of classes A^(t) 

Let us consider elementary properties of classes Ad{r) which are essentially 
used in the proof of Theorems 2 and 3, see Zaitsev [29, 31, 33-35]. It is easy 
to see that t\ < implies Ad (ti) C Ad (^2). Moreover, the class Ad (t) is 
closed with respect to convolution: if F 1 ,F 2 G Ad(r), then Fi_F 2 = Fi * F 2 G 
Ad (t) . Products of measures are understood in the convolution sense. Note that the 
condition £(Cfc) € Ad(r) in Theorem 3 is satisfied if £(£j) G Ad(T), for j = 1, . . . , n. 

Let t > 0, F = £(0 G Ad(r), y G R m , and A : R d -> R m is a linear operator. 

Then 

£(Ati+y) G An(||A||r), where ||A|| = sup \\Ax\\ . 

xe~R d , \\x\\<i 

Suppose that r > 0, Ffc = C (^ fe) ) G Ad k (r), and the vectors k = 1, 2, are 
independent. Let £ G R dl+d2 be the vector with the first d\ coordinates coinciding 
with those of £W and with the last c? 2 coordinates coinciding with those of ^ 2 ^. 
ThenF = £(^ G ^ dl+ d 2 (r). 

The classes ^4d(r) are closely connected with other naturally defined classes 
of multidimensional distributions. From the definition of Ad(r) it follows that if 
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£(£) G Ad(r) then the vector £ has finite exponential moments Ee' k '^ < oo, for 
h G R d , ||/i|| t < 1. This leads to exponential estimates for the tails of distributions. 

The condition £(£) G is equivalent to Statulevicius' [27] conditions on 

the rate of increasing of cumulants j m of the random variable £: 

|7m| < ^m!r m - 2 7 2, m = 3,4,.... 

This equivalence means that if one of these conditions is satisfied with parameter r, 
then the second is valid with parameter cr, where c denotes an absolute constant. 
However, the condition £(£) G -4d(r) differs essentially from other multidimensional 
analogs of Statulevicius' conditions, considered by Rudzkis [23] and Saulis [25]. 
Zaitsev [30] considered classes of distributions 

B d {r) = {f = £(0:E£ = 0, 

< l m \ T m - 2 | u || m ~ 2 E (£,v) 2 for all w,v G R d , m = 3,4,...} 

satisfying multidimensional analogs of the Bernstein inequality condition. Sakha- 
nenko's condition £(£) G <5i(r) is equivalent to the condition £(£) G Bi(r). Note 
that if F{{x G R d : < r}} = 1 then F G B d (r). 

Let us formulate a relation between classes Ad(r) and Bd(r). Denote by a 2 (F) 
the maximal eigenvalue of the covariance operator of a distribution F. Then 

a) If F = £(£) G B d (r), then er 2 (F) < 12 r 2 , E( = 0andFe A d {cr). 

b) If F = £(0 G ^ d (r), <r 2 (F) < r 2 and E£ = 0, then F £ B d {cr). 

If F is an infinitely divisible distributions with spectral measure concentrated 
on the ball {a; G R d : ||or|| < r} then F G Ad(cr), where c is an absolute constant. 
It is obvious that the class Ad (0) coincides with the class of all d-dimcnsional 
Gaussian distributions. The following inequality was proved in Zaitsev [29] and can 
be considered as an estimate of stability of this characterization: 

if FeA d (r), then tt(F, $(F)) < cd 2 r log*^- 1 ); (3.1) 

where w(-, •) is the Prokhorov distance and <j>(F) denotes the Gaussian distribution 
whose mean and covariance operator coincide with those of F. The Prokhorov 
distance between distributions F, G may be defined by means of the formula 

ir(F, G) = inf {A : ir(F, G, A) < A} , 

where 

n(F,G,X) =supm&x{F{X} - G{X X },G{X} - F{X X }} , A > 0, 

and X x = {y G R d : inf ||a; — y\\ < A} is the A-neighborhood of the Borel set X. 
Moreover, in Zaitsev [29] it was established that 

tt(F,$(F),A) < c d 2 cxp(--A-). (3.2) 



E {ti,v) 2 (Z,u) 



Estimates for the Strong Approximation 



113 



It is very essential (and important) that the inequality (3.2) is proved for all t > 
and for arbitrary covF, in contrast to Theorems 2 and 3, where r > 1 and co- 
variance operators satisfy condition (2.2). The question about the necessity of 
condition (2.2) in Theorems 2 and 3 remains open. In Zaitsev [30] inequalities 
(3.1) and (3.2) were proved for convolutions of distributions from Bd{r) 

By the Strassen-Dudley theorem (see Dudley [9]) coupled with inequality (3.2), 
one can construct on a probability space the random vectors £ and r\ with £(£) = F 
and C(r]) = <P(F) so that 

P{U-V\\ >A}< c d 2 cxp(-^_). (3.3) 

For convolutions of bounded measures, this fact was used by Rio [21], Einmahl and 
Mason [13], Bovier and Mason [5], Gentz and Lowe [15], Einmahl and Kuelbs [14]. 

The scheme of the proof of Theorems 2 and 3 is very close to that of the main 
results of Sakhanenko [24] and Einmahl [12]. We suppose that the Gaussian vec- 
tors Yi,...,Y n , n = 2 N , are already constructed and construct the independent 
vectors which are bounded with probability one, have sufficiently smooth distribu- 
tions and the same moments of the first, second and third orders as the needed 
independent random vectors X\ , . . . , X n . For the construction we use the dyadic 
scheme proposed by KMT [17]. Firstly we construct the sum of 2^ summands 
using the Rosenblatt [22] quantile transform for conditional distributions (see Ein- 
mahl [12]). Then we construct blocks of 2 JV_1 , 2 N ~ 2 , . . . , 1 summands. The rate of 
approximation is estimated using the fact that, for smooth summands distributions, 
the corresponding conditional distribution are smooth and close to Gaussian ones. 
Then we construct the vectors X\, . . . ,X n in several steps. After each step the 
number of Xk which are not constructed becomes smaller in 2 P times, where p 
is a suitably chosen positive integer. In each step we begin with already constructed 
vectors which are bounded with probability one and have sufficiently smooth dis- 
tributions and the needed moments up to the third order. Then we construct the 
vectors such that, in each block of 2 P summands, only the first vector has the 
initial bounded smooth distribution. The rest 2 P — 1 vectors have the needed 
distributions £(£&)• These 2 P — 1 vectors from each block will be chosen as Xk 
and will be not involved in the next steps of the procedure. The coincidence of third 
moments will allow us to use more precise estimates of the closeness of quantiles of 
conditional distributions contained in Zaitsev [32]. In the estimation of closeness 
of random vectors in the steps of the procedure described above, we use essentially 
properties of classes Ad(r). 

4. Infinitely divisible approximation 

Let us finally mention a result about strong approximation of sums of indepen- 
dent random vectors by infinitely divisible distributions. Theorem 4 below follows 
from the main result of Zaitsev [32] coupled with the Strassen-Dudley theorem. In- 
equality (4.1) can be considered as a generalization of inequality (3.3) to convolution 
of distribution with unbounded supports. 
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Theorem 4. Let d- dimensional probability distributions Fi, i = 1, . . . ,n, be 

represented as mixtures of d- dimensional probability distributions Ui and V^: 

Fi = {l-Pi)Ui+PiVi, 

where 

0<p l <l, JxUi{dx} = 0, Ui {{x E K d : \\x\\ < t}} = 1, 

and Vi are arbitrary distributions. Then for any fixed A > one can construct on 
the same probability space the random vectors £ and rj so that 

V{U-r,\\>\}<c(d) (m^ + exp(--A-)) +£ p 2 (41) 

and 

n n 

C(0 = HF U C(0 = l[e(Fi), 

1=1 1=1 
where c{d) depends on only and e(Fi) denotes the compound Poisson infinitely 
divisible distribution with characteristic function exp(Fi(t) — 1), where Fi(t) — 
J e ttx Fi{dx}. If the distributions Vi are identical, the term Y^i=iPi ^ n (4-1) can 
be omitted. 
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